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Abstract 



In this article we introduce the operations of insertion and deletion working in a random-context 
and semi-conditional manner. We show that the conditional use of rules strictly increase the com- 
putational power. In the case of semi-conditional insertion-deletion systems context-free insertion 
and deletion rules of one symbol are sufficient to get the computational completeness. In the ran- 
dom context case our results expose an asymmetry between the computational power of insertion 
and deletion rules: systems of size (2, 0, 0; 1, 1, 0) are computationally complete, while systems of 
size (1, 1, 0; 2, 0, 0) (and more generally of size (1, 1, 0;p, 1, 1)) are not. This is particularly inter- 
esting because other control mechanisms like graph-control or matrix control used together with 
insertion-deletion systems do not present such asymmetry. 



1 Introduction 



Insertion-deletion systems are a powerful theoretical computational device which is based on two elemen- 
tary operations of insertion and deletion of substrings in a string. These operations were first considered 
with a linguistic motivation in [18] and latter developed in [8j [23] . These references investigate Marcus 
contextual grammars which capture many interesting linguistic properties like ambiguity and duplication. 

Another motivation for these operations can be found in (TUJ E] where the insertion operation and its 
iterated variant is introduced as a generalization of Kleene's operations of concatenation and closure [14] . 
The operation of concatenation would produce a string xyz from two strings xy and z. By allowing the 
concatenation to happen anywhere in the string and not only at its right extremity a string xzy can be 
produced, i.e., z is inserted into xy. In [12j the deletion is defined as a right quotient operation which 
happens not necessarily at the rightmost end of the string. 

The third inspiration for insertion and deletion operations comes, surprisingly, from the field of 
molecular biology. In fact they correspond to a mismatched annealing of DNA sequences, see [55] 
for more details. Such operations are also present in the evolution processes under the form of point 
mutations as well as in RNA editing, see the discussions in [3j [U [27] [25] . This biological motivation 
of insertion-deletion operations led to their study in the framework of molecular computing, see, for 
example, [S [H 123 [2H] ■ 

In general, an insertion operation means adding a substring to a given string in a specified (left and 
right) context, while a deletion operation means removing a substring of a given string from a specified 
(left and right) context. A finite set of insertion-deletion rules, together with a set of axioms provide 
a language generating device: starting from the set of initial strings and iterating insertion-deletion 
operations as defined by the given rules, one obtains a language. 

Even in their basic variants, insertion-deletion systems are able to characterize the recursively enumer- 
able languages, see [3D] for an overview of known results. Moreover, as it was shown in [15], the context 
dependency may be replaced by insertion and deletion of strings of sufficient length, in a context-free 



manner. If the length is not sufficient (less or equal to two) then such systems are not able to generate 
more than the context-free languages and a characterization of them was shown in [29] . 

Similar investigations were continued in |20[ 1151 116] on insertion-deletion systems with one-sided 
contexts, i.e., where the context dependency is asymmetric and is present only from the left or only 
from the right side of all insertion and deletion rules. The papers cited above give several computational 
completeness results depending on the size of insertion and deletion rules. We recall the interesting fact 
that some combinations are not leading to computational completeness, i.e., there are languages that 
cannot be generated by such devices. 

Similarly as in the case of context-free rewriting, it is possible to consider a graph-controlled variant of 
insertion-deletion systems. Thus the rules cannot be applied at any time, as their applicability depends on 
the current "state" , changed by a rule application. Such a formalization is rather similar to the definition 
of insertion-deletion P systems [24] . One-sided graph-controlled insertion-deletion systems where at most 
two symbols may be present in the description of insertion and deletion rules were investigated in [7] . This 
correspond to systems of size (1, 1, 0; 1, 1, 0), (1, 1, 0; 1, 0, 1), (1, 1, 0; 2, 0, 0), and (2, 0, 0; 1, 1, 0), where the 
first three numbers represent the maximal size of the inserted string and the maximal size of the left and 
right contexts, resp., while the last three numbers represent the same information for deletion rules. It 
is known that such systems are not computationally complete [IT, while the corresponding P systems 
and graph-controlled variants are. A summary of these results can be found in [1] 130] . 

Further adaptation of control mechanisms from the area of regulated rewriting leads us to matrix 
insertion-deletion systems |21j . where insertion and deletion rules are grouped in sequences, called matri- 
ces, and either the whole sequence is applied consecutively, or no rule is applied. It was shown that in the 
case of such control the computational power of systems of sizes above is strictly increasing. Moreover, 
binary matrices suffice to achieve this result giving a characterization similar to the binary normal form 
for matrix grammars. 

In this paper we continue the investigation on the "regulated" insertion-deletion. We adapt the 
idea of semi-contextual and random context grammars to these systems. More precisely, a permitting 
and forbidding condition is associated to each insertion and deletion rule. In the case of semi-contextual 
control the conditions are based on sets of words, while in the random context case the conditions are sets 
of letters. A rule can be applied if all the words from the permitting condition are present in the string, 
while no word from the forbidding condition is present. We show that the computational completeness can 
be achieved by semi-conditional insertion-deletion systems whose permitting and forbidding conditions 
contain words of length at most 2 and using context-free insertion and deletion of one symbol, i.e. rules 
of size (1,0,0; 1,0,0). This is quite interesting result showing that the semi-conditional control is very 
powerful - even in the case of graph-controlled insertion-deletion systems with appearance check only 
recursively enumerable sets of numbers can be obtained with same insertion and deletion parameters. 

Until now all the results in the area of insertion-deletion systems (with or without control mechanisms) 
exhibited the property that the computational completeness is achieved or not at the same time by 
both systems of size (n,m,m']p,q,q') and (p,q,q';n,m,m'), i.e. when the parameters of insertion and 
deletion are interchanged. This empirical property suggested that with respect to the computational 
completeness the insertion and deletion rules are similar (even if the proofs use quite different ideas). 
This article shows for the first time that this property does not always hold, the counterexample being 
random context insertion-deletion systems of size (2, 0, 0; 1, 1, 0) which generate all recursively enumerable 
languages, while systems of size (1, 1, 0; 2, 0, 0) (and more generally of size (1, 1, 0;p, 1, 1)) cannot do this. 

2 Preliminaries 

We do not present the usual definitions concerning standard concepts of the theory of formal languages 
and we only refer to |26j for more details. The empty string is denoted by A. The family of recursively 
enumerable, context-sensitive and context-free languages is denoted by RE, CS and CF, respectively. 
We will denote the length of a string w with \w\; we will refer to the number of occurrences of the symbol 
a in the string \w\ by the notation \w\ a . 

In the following, we will use special variants of the Geffert normal form for type-0 grammars (see [9] 
for more details). 

A grammar G — (N, T, S, P) is said to be in Geffert normal form [S] if the set of non-terminal symbols 
N is defined as TV = {S, A, B, C, D}, T is an alphabet and P only contains context-free rules of the forms 
S — > uSv with u G {A, C} + and v G (T U {B, D}) + as well as S — >• A and two (non-context-free) erasing 
rules AB A and CD -t X. 



We remark that in the Geffert normal form we can easily transform the linear rules (suppose that 
A, B, C and D are treated like terminals) into a set of left-linear and right-linear rules (albeit by 
increasing the number of non-terminal symbols, e.g., see |24j). More precisely, we say that a grammar 
G = (N, T, S, P) with N = N' U N", S, S' G N', and N" = {A, B, C, D}, is in the special Geffert normal 
form if, besides the two erasing rules AB — > A and CD —> A, it only has context-free rules of the following 
forms: 

X -> bY, where X, Y G N' , b G 2V", 1^7 
A^Y"6, where A,F G iV',6 G TUN", AT ^ Y" 
5' -> A. 

Moreover, except rules of the forms X — > Sb and X — > S'b, we may even assume that for the first 
two types of rules it holds that the right-hand side is unique, i.e., for any two rules X — > w and U — > w 
in P we have U = X. 

The computation in a grammar in the special Geffert normal form is done in two stages. During the 
first stage, only context-free rules are applied. During the second stage, only the erasing rules AB — > A 
and CD — » A are applied. These two erasing rules are not applicable during the first stage as long as 
the left and the right part of the current string are still separated by S (or S') as all the symbols A and 
C are generated on the left side of these middle symbols and the corresponding symbols B and D are 
generated on the right side. The transition between stages is done by the rule S' — > A (which corresponds 
to the rule S — > A from the Geffert normal form). We remark that all these features of a grammar in 
the special Geffert normal form are immediate consequences of the proofs given in [5] . 

Since during the first stage symbols A, B, C and D act like non-terminals, if no confusion arises, we 
consider that during this stage there is only one non-terminal and there are only left- and right-linear 
rules. 

We give below the definitions of semi-conditional and random-context grammars, as given in |26) . 

A semi- conditional grammar is a quadruple G = (N,T, S, P), where N,T,S are specified as in a 
context-free grammar and P is a finite set of triples of the form p = (A — > w; E, F), where A — > w is a 
context-free production over NUT and E, F are finite subsets of (N U T) + . Then, p can be applied to a 
string x G (N U T)* only if A appears in x, each element of E and no element of F is a subword of x. If 
E or F is the empty set, then no condition is imposed by E, or respectively, F. E is said to be the set 
of permitting and F is said to be the set of forbidding context conditions of p. 

We remark that in some sources, e.g. |22) . the definition of semi-conditional grammars implies that 
l-^l < T 1-^1 < 1- We consider the definition from [26] where the above restriction is not present. We 
denote by SCij the family of semi-conditional grammars where |iui| < i, \w2\ < j, for any wi G E and 
W2 G F for any rule (^4 — > w; E, F) of the grammar. 

A random context grammar is a semi-conditional grammar where E, F C N. 

Semi-conditional grammars generate all recursively enumerable or all context-sensitive languages, 
depending on whether A-rules are used or not, respectively. 

2.1 Insertion-deletion systems 

An insertion-deletion system is a construct ID = (V,T, A, I, D), where V is an alphabet; T C V is the 
terminal alphabet (the symbols from V\T are called non-terminal symbols); A C V* is the set of axioms; 
I,D are finite sets of triples of the form (it, a, v), where u, a (a ^ A), and v are strings over V. The 
triples in / are insertion rules, and those in D are deletion rules. An insertion rule (u, a, v) G / indicates 
that the string a can be inserted between u and v, while a deletion rule (u, a, v) G D indicates that a 
can be removed from between the context u and v. Stated in another way, (u, a, v) G / corresponds to 
the rewriting rule uv — > uav, and (u,a,v) G D corresponds to the rewriting rule uav — > uv. By =>i„ s 
we denote the relation defined by the insertion rules (formally, x =>i ns y if and only if x — x\uvx2 , y — 
x\uavx2, for some (u, a,v) £ I and Xi,x% G V*), and by ^dei the relation defined by the deletion rules 
(formally, x =>de/ y if and only if x = X\uavx2,y = X1UVX2, for some (u,a,v) G D and x±,X2 G V*). By 
=> we refer to any of the relations =>i ns , =>dei, & n d by =>* we denote the reflexive and transitive closure 
of=K 

We will often consider R = IUD and distinguish between insertion and deletion rules by the subscripts 

ins Or del • 

The language generated by ID = (V, T, A, I, D) is defined by 

L(ID) — {w G T* I x =4>* w for some x G A}. 



The complexity of an insertion-deletion system ID = (V, T, A, I, D) is described by the vector 
(n,m,m';p,q,q') called size, where 



n = max{|a| | (u, a, v) E /}, p = max{|a| | (it, a, v) E D}, 
to = max{|it| | (u, a, v) E /}, q = max{|u| | {u, a, v) E D}, 
m! = max{|i;| | (u, a, v) E /}, q' = max{|i;| | (u,a,v) E D}. 

The total size of an insertion-deletion system ID of size (n, to, m';p, q, q') is defined as the sum of all 
the numbers from the vector: T,(ID) = n + m + m'+p + q + q'. 

By INS™' m DEL™ we denote the families of languages generated by insertion-deletion systems 
having the size (n,m,m';p,q,q r ). 

If one of the parameters n,m,m! ,p,q,q' is not specified, then instead we write the symbol *. In 
particular, INS*' DEL®' denotes the family of languages generated by context-free insertion- deletion 
systems. If one of the numbers from the pairs m, to' and/or q, q' is equal to zero (while the other one is 
not), then we say that the corresponding families have a one-sided context. Finally we remark that the 
rules from / and D can be put together into one set of rules R by writing (u, a, v) ins for (u, a, v) El 
and (u,a,v) del for (tt, a,v) E D. 

2.2 Conditional insertion-deletion systems 

In a similar way to context-free grammars, insertion-deletion systems can be extended by adding some 
additional controls. We discuss here the adaptation of the idea of semi-conditional and random context 
grammars for insertion-deletion systems and define corresponding types of insertion-deletion systems. 
A semi-conditional insertion- deletion system of degree > 1 is a construct 

7 = (V,T,A,R), where 

• V is a finite alphabet, 

• T C V is the terminal alphabet, 

• A C V* is a finite set of axioms, 

• R — {n, . . . , r n }, n > 1 is a finite set of rules of the form : ((a, b, c) t ,P, Q), where (a, b, c), is 
an insertion rule if t = ins or a deletion rule if t = del, P C V 1 , Q C V- 7 ', 1 < k < n. 

The sentential form (also called configuration) of 7 is a string w E V* . For : ((a, b, c) t ,P,Q) a 
transition w => rk w' , for 1 < k < n, is performed if w => t w' (t E {ins, del}) and for all x E P, x is a 
subword of w and for no y E Q, y is a subword of w. 

The language generated by 7 is defined by 

£(7) ~ {w ET* \ x =>* w for some x E A}. 

For i, j > 1 we denote by SCi.jINS™ ,m ' DEL™' the families of languages generated by semi- 
conditional insertion-deletion systems of degree (i,j) having insertion and deletion rules of size 
(n,m,m';p,q,q r ). 

We also define random context insertion-deletion systems as semi-conditional insertion-deletion sys- 
tems where i=j = 1. They are denoted with RC INS™'" 1 ' DEL™' . 

3 Semi-conditional insertion-deletion systems 

In this section we show that semi-conditional insertion-deletion systems of degree (2, 2) are computa- 
tionally complete even if rules inserting or deleting a single symbol are used. We start by showing that 
that this result is optimal with respect to the size of insertion and deletion rules. 

Theorem 1. SC 2 ,2lNS°{° DEL ' C RE. 

Proof. Let 7 = (T,T, A, R), 7 E INS ' DEL ' be an insertion-deletion system. Consider the following 
context-free grammar G = ({S},T,S,P), P = {S -> SaiS ...Sa n \ a 1 ...a n E A} U {S SaS 
a E T} U {S — > A}. Let G' be the grammar obtained from G by the elimination of A-productions. 
Since A ^ L(j), G' does not contain any erasing rules. This combined with the fact that the family of 
semi-contextual grammars without A-productions is included in CS yields the result. □ 



Theorem 2. S C '2,2 1 'N S°'° DEL°{° = RE. 

Proof. We will start by proving the inclusion SC2.2IN Si'° 'DEL® D RE. To achieve this goal, we 
will show the inclusion SC2.2IN S 0,0 DEh\'° D XRC ac , where XRC ac is the family of random context 
grammars. 

Consider an arbitrary random context grammar G — (V,T, 5, R). The set R contains context-free 
rules with permitting and forbidding contexts. 

We can assume that for any random context rule r : (A — > u,P,Q) of G, either \u\ = 2, or \u\ = 0. 
Indeed, any rule r : (A — > uiu 2 ■ ■ ■ u n , P, Q) can be replaced by a set of following random context rules: 

{(A -> u{W?\P, Q U Q w )} U {(W r W -»• u i+l W? +1 \ 0, 0)|1 < i < n - 1}U 
U{(W r (n) -> A, 0,0)}. 

The set above is defined as follows: 

= {W r W) I 3r : (B -> /3, P', Q') e P, 1 < j < n r }, 

where n r is the number of symbols in the right-hand side of the rule with label r. 

It can be easily seen that the rules above will simulate the context-free rule A — > u\...u n . The 
context checking is performed during the application of the rule A — > uiWi 1 ^. The presence of Qw in 
the forbidding set ensures that once started, the sequence of related rules shall be terminated. 

Before proceeding to the simulation we will do some preliminary considerations. First of all, the 
central problem with SC22I ~N S 0,0 *D E 'L°'° systems is that there is no direct way to check the context 
of a specific symbol in the string. To address this problem, we encode every symbol a e V with a 
pair of symbols da and place special terminators B and E at the beginning and the end of the string 
correspondingly. Every rule will have a special forbidding context which will check whether the string 
has this form. We will refer to this forbidding context as to the normalization condition. 

Now, to operate at a specific locus in the string, we insert some service symbol and use the per- 
mitting/forbidding contexts of the subsequent rules to check whether it is located in the appropriate 
position. We then insert and delete symbols as we need, and the normalization condition included in 
every rule will assure that the operations we are performing are only happening in some neighborhood 
of the service symbols: whenever the proper organization of the string is broken, no rules will be further 
applicable, thus blocking the computation at a string which contains non-terminal symbols. 

Having done the overview of our approach, we are ready to present the solution itself. 

Let c : V — > (V U V) 2 be a coding defined as c(a) = da, a e V. 

The normalization condition is the following forbidding condition added to the forbidding contexts 
of the majority of rules: 

Q M = {£y, xy \ x, y G V} U {Bx, xE \ x G V} U {uB, Eu \ u £ V U V}. 

It can be seen that if Vi = V U V U {B, E} then V{ \ Qm = {B(xy)+E | x, y e V}. Indeed, the first 
group of restrictions requires that the string only contains an alternating sequence of hatted and barred 
symbols, the third group requires the string to begin by B and to end by E, while the second group 
states that the first symbol after B has a hat and the symbol before E has a bar. A string of the form 
above is said to be in a normalized form. 

We construct the following semi-conditional insertion-deletion system T — (Vr, 7r, Ar, Rr)- 
The terminal alphabet Tr — {a \ a e V} contains the hatted versions of original terminals. The 
alphabet Vr includes contains the service symbols, the hatted and barred versions of every symbol in the 
alphabet of G, and the new terminal symbols: 

Q # = {# p |3p: (A->X,P,Q) G R}, 

Q$ = I 3q: (A ->a,P,Q) €R,a^X,l<i < 5}, 

Vr = Q# UQ$ U {a, a | a e V} UT r . 

The only axiom of the insertion-deletion system is A-p = {BSSE}. 

Note that we are defining the new terminal alphabet Tr instead of re-using the old T only for the 
sake of keeping the description simple and that this change does not affect the result itself. 

The set of rules Rr is constructed in the following way: 



• for every rule p : (l4A,P,Q) G R we add to i?r the rules 



p.l : ((A, # p , A) ms , {11} U P, Q # U Q$ U Q U Qat) 

p.2: ((A, X, \) deU {# P X},Q N ) 

p.3 : ((A, X, \) del , {# p X}, {X# p } U Qjv) 

V A : ((A, # p , A) dei ,0,Q^), 

• for every rule q : (X -> FZ, P, Q) G P, X, Y, Z G V we add to R r the rules 

g.l :((A, $«, A) ms ,{il}UP,Q # UQ,UQUQ A r), 

g.2:((A, $< 2 >, A) jn8 , {$«X}, {$( 2 )} U Qjy), 

g.3 :((A, X, X) del ,{$^X,X$^},Qu), 

g.4:((A, $< 3 \ A) ms , {$< 2 )X, $( 3 \ X$«} U Q^), 

g.5:((A, $«, \) del , {$<, 3) $«}, Qx), 

g.6 :((A, X, A) dei , {$< 3 >X}, {$«} U Q N ), 

g.7:((A, $( 4 >, A) ms , }, {$« $< 4 >} U Q^), 

g.8:((A, %f\ X) del , {%[% 3) }, Qm), 

g.9 :((A, $< 5 \ A) ms , {$< 4 >}, {$( 2 >, $( 5 \ X%^} U Q^), 
g.lO:((A, Y, \) ms ,{$^%f\$f%f},{Y$^}yjQ N ), 
q.ll :((A, Y, \) ins ,{%WYM?\$fHf},{§fY}UQ N ), 
g.l2:((A, £, A) ms , {$< 4 >Y, Y%f\ {Z$^} U Q^), 

g.l3:((A, Z, X) ms , {$< 4 >Y, Y$f\ %fz, Z%f}, {%fz} U Q N ), 
g.l4:((A, A) de; ,{$( 4 )y,F$( 3 ),$( 3 )Z,Z$( 5 )},Q A ,), 
g.l5:((A, $W, A) de ,, {$< 4 >Y, Z$( 5 >}, {$< 3 >} U Q^), 
g.l6:((A, Sf, A) de; ,0,{$( 3 ),$( 4 )}UQ^), 

• we also add the rules 

{((A, P, A) de ,, 0, Vr\ ({P, P} U {a, a \ a G T})), 
((A, £7, A) de ,,0,{P})} 

u{((A, a, A) de ,,0,{p,p» |oey\r}. 

We will start our explanations with analyzing the third and simplest group of rules. These rules 
permit to consecutively erase symbols P, E and x, where x G T. The first forbidding condition ensures 
that the group is applicable only for strings of form w = B(xy)*E, x,y G T. So the result of the 
application of this group of rules on w is a projection of w over V. 

We will now turn to simulating the erasing random context rule p : (X — > X,P,Q). The correct 
simulation sequence is as follows: 

WiXXw 2 ^=> Wi# p iliii2 =^=> w 1 # p Xw 2 =^4> wi# p w 2 =^4> W1W2, 

where 101,102 G V£ ■ Note that if w\XXw2 satisfies the normalization condition, then w\W2 also satisfies 
it. 

It turns out that almost any other derivation sequence will result in an unproductive string with 
non-terminals. Indeed, the permitting context of rule p.2 will make it inapplicable if # p has not been 
properly positioned by p.l. Further, if p.2 erases the wrong X (i.e., not the X located immediately to 
the right of # p ), then the string will not satisfy the normalization condition and the computation will 
halt at a string with non-terminals. The contexts of the rule p.3 assure that X is erased only after the 
correct instance of X has been erased by p.2. Note that the forbidding word X# p prevents p.3 from 



being applicable if p.l inserts # p between X and X. Finally, if p. 4 erases # p when only X has been 
erased, the string will no longer satisfy the normalization condition; on the other hand, if # p is erased 
immediately after it has been inserted, no effect is produced at all. 

Now observe that, after the applications of p.l, p.2, and p.'i two valid scenarios are possible. If 
w 2 ^ XXw' 2 , w' 2 G Vp*, then the only applicable rule is pA, because the # p blocks applications of any 
other rules. If, however, w 2 = XXw' 2 , then the rule p.2 is applicable again, together with pA. Thus, 
instead of erasing # p by pA, the system may erase one more pair XX. This is not a problem, though, 
because this would correspond to the simulation of another application of the rule p. 

Note that it is not necessary to check for the permitting and forbidding contexts of the original 
rule p (P and Q correspondingly) before the application of the rule p.2, even if p.2 may start as many 
simulations of the erasing rule p as there are consequent instances of XX at the locus marked by # p . 
The reason is that p.2 is only applicable after an application of p.l (which adds a # p ), and p.l includes 
both P and Q in its contexts already. 

Quite as expected, the simulation of the generic context-free rule q : (X — > YZ, P, Q) is the most 
complicated. As usual, we will start with showing the correct simulation sequence: 

Wl XXw 2 ^4 W1 $WXXW2 =^4 w 1 $ q 1) XX$ q 2) w 2 ^4 Wl $Wx$Ww2 

=^4 Wl $f$Wl$f) ffl2 Wl $^X$^w 2 J4 Wl $^w 2 

^ Wl %^%fH^w 2 J4 Wl $^w 2 Wl $f>$^w 2 

^ Wl $^Y^H^w 2 Wl $^YY$^w 2 q =% Wl $^YY%^Z%^ W2 
Wl %^YY%fZZ%fw 2 m Wl %^YYZZ%^w 2 ^ Wl YYZZ$^w 2 ^ Wl YYZZw 2 . 

Here, again, w\,w 2 € V£ ■ We remark that if w\XXw 2 satisfies the normalization condition, all the 
strings shown in this derivation sequence satisfy the normalization condition as well. 

We claim that any derivations aside from the one shown above result in unproductive branches. In 
the next paragraphs we will briefly expose the rationale behind this statement. 

It should be clear by now that the principal way to control the loci of insertions and deletions is 
to carefully construct the permitting and forbidding contexts of the rules which are to be applied next. 
Consider the contexts of q.2, which compel the rule q.l to insert % q exactly before an instance of X. 
Similarly, the application of q.2 will produce a valid string only $ q 2 ^ is inserted after an instance of X. 
Note that at the moment we have not assured yet that $^ and $^ arc located before and after the 
same pair XX. 

Once the symbols $q and have been inserted into the string before and after some pairs XX, 
the rule q.3 will erase an instance of X; the normalization condition will assure that the erased symbol 
was located exactly after $ < q 1 \ 

The contexts of the rule qA guarantee that the proper instance of X has already been erased (the 
permitting condition $ < q 1 * > X) and will block the rule in the cases when S^ 1 -* is inserted between X and X 
and when $ < q 2 * > is inserted after $^; in the latter case q.3 would be blocked. 

The rule q.b assures the proper localization of $ q 3 \ i.e., before $ < q 1 \ The effect of rules qA and q.5 is 
thus to substitute $ < q 1 " > with $ q 3 \ 

The rule q.6 finalizes the erasure of XX. X cannot be erased before Sg 1 - 1 is erased, which also 
guarantees in the long run that $ q 3 ^ has been properly positioned. 

In the good case we would now expect to have $g 3 ^$q 2 ' 1 ; however, we have not yet assured that the 
symbol $ q was inserted after the same pair XX as the pair before which $\ ' was inserted. This is the 
job of the rule q.l: the symbol %\p is only inserted when there actually is a pair S^Sq 2 -* in the string. If 
this is not the case, $^ will not be inserted; it is easy to verify that the subsequent rules will be rendered 
inapplicable in this case, thus blocking the computation at a string with non-terminals. 

The rule q.8 assures proper positioning of $ q 4 \ Again, if $^ is not erased here, all further rule 
applications will be blocked. 

The rule q.9 inserts yet another service symbol $ q 5 \ The contexts of this rule assure that it is only 
applied at the proper time. The contexts of q.10 assure the proper positioning of % q 5 \ An application 
of the rule q.10 inserts an instance of Y and, by the normalization condition, this will not block any 
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computations only if Y is inserted to the left or to the right of a service symbol of the $ family. The 
contexts of the rules q.10 and q.ll assure that Y is placed exactly between $ < q 4 * > and $ q 3 ^ . Similarly, the 
contexts of q.ll and q.12 guarantee the proper positioning of Y. And again, the situation is exactly the 
same for the rules q.12 and (7. 13, which are guaranteed to place the corresponding symbols at the proper 
sites. 

A very important remark is that, once we have more than one non-special symbol between two special 
symbols (i.e, the symbols of the $ family) , the normalization condition becomes relevant for the substring 
between the special symbols. This limits the number of times the insertion rules q.10 to q.13 can be 
applied. 

The contexts of the rule q.lA provide the final verification of the configuration of the string around 
the symbols $ q 4 \ $f\ and $ q 5 \ Once Sg 3 '' is erased, q.15 erases $ q 4 K Note how the contexts of q.15 only 
allow this rule to be applied after the $ q has been erased. Finally, q.16 finalizes the clean-up of the 
string. 

Now in order to finish the proof of the theorem we show the inclusion SC 2 ,2lNS°' DEL°{ C RE. 
Since INS°'°DEL°{° C CF (see [29]) we obtain the inclusion SC 2a IN S°{° DEL°{° C SC 2 ,2 Q RE. □ 



4 Random Context Insertion-Deletion Systems 

In this section we consider random context insertion-deletion systems. We show that such systems having 
rules of size (2, 0, 0; 1, 1, 0) are computationally complete, while systems of size (1, 1, 0;p, 1, 1), p > 0, are 
not. 

4.1 Computational Completeness 

We start by showing the computational completeness of the family RC I N S®' D E L\ ,q . 
Theorem 3. RC IN DEL X {° = RE. 

Proof. We show only the inclusion RC INS^DEL^ D RE. For the converse inclusion we invoke the 
Church- Turing thesis. We will simulate an arbitrary grammar in the special Geffert normal form. Let 
G = (V, T, S, R) be such a grammar. We will simulate G with the following random context insertion- 
deletion system T = (V r ,T, {S}, R r ). 

The alphabet Vr is constructed in the following way: Vp = V U Ar ■ The set Ar contains the non- 
terminal service symbols we add to V. These symbols are the hatted and barred version of A, B, C, and 
.D, as well as the additional symbols of $ r , # q , and f r families, added on per-rule basis, as shown in the 
following paragraphs. 

The set of rules Rr is constructed in the following way: 

• for every rule (p : X — > cY) e R, c e T we add the following to Rr- 

{ p.l: ((A, cY, X) ins ,{X},(N'UNr)\{X})), 
p.2: ((y, X, A) dei ,0,0) }, 



• for every rule (p : X — > ZY) E R, Z G N we consider the rule p' : (X —> ZZY); this rule can 
be rewritten as {X — > ZX',X' — > ZY}; we then apply the same reasoning as in the previous 
paragraph, 

• for every rule (q : X —> Yc) E R, c E T we add the following to Rr- 

{ q.l: ((A, X) ins ,{X},(N'UN r )\{X}), 

q.2: (#' ? , X, A) de; ,0,0), 

q.3: ((A, Yc, A) ms , {# J , {A, y}), 

qA: (c, A) de/ ,0,0), 

q.5: ((A, #„ A) de ,, 0, {#,}) }, 



• for every rule (q : X — > YZ) E R, Z E N we consider the rule q' : (X — > YZZ); this rule can 
be rewritten as {A — > X'Z,X' — > YZ}; we then apply the same reasoning as in the previous 
paragraph, 

o 



• for every rule (r : UV ->• A) G R, (U, V) £ {(A, B), (C, D)} we add the following to R r : 
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Consider the application of the rule p : X — >• cF to wiXw 2l w\,w 2 6 (TU TV)*. Remember that, for 
any string w generated by a grammar in the special Geffert normal form, the following holds: \w\z < 
1,VZ G AT'. 

The correct sequence of T events simulating the rule p is as follows: 

w\Xw 2 => wicYXw 2 ==i wicYw 2 - 

Note that p.l will only be applicable when there is one copy of X in the string and where there is no 
other symbol from N' U iVr- This, among others, prohibits starting the simulation of rule p if another 
rule is being simulated. 

Now suppose that cY is inserted in a different place in the string (not before X). In this case p.2 
will not be applicable in the new configuration and both X and Y will remain in the string. Since a 
simulation of a rule cannot start when there is more than one symbol from N' in the string, the derivation 
halts at this invalid string. 

The simulation of the rule q : X — > Yc takes more steps. The reason is that we use deletion rules to 
assure proper location of insertion sites, and deletion rules have left contexts, while in the string Yc the 
symbol c is to the right of Y. 

The correct sequence of T events is the following. 

W 1 Xw 2 =^4> Wl# q # q Xw 2 => Wl# g #' q W2 =^4> Wl# q Yc#' q W2 

=?4> Wi# q Y 'cw 2 =?4> wiYcw 2 . 

We will now focus on analysing the situations when the insertions do not happen at desired loci. 
Suppose that q.l does not insert before X. In this case q.2 is not applied. The rule q.3 cannot 

be applied, because there is an instance of X in the string. The rule qA cannot be applied, because #^ 
is located immediately after # g . The rule q.5 is inapplicable, because there is an instance of #^ in the 
string. Therefore, if is not inserted right before X, all three symbols from N' (namely, X, # q , 

and #' q ) remain in the string. This prevents any further simulation of rules and the derivation halts at 
an invalid string. 

The other insertion rule is q.2>. Suppose that Yb is inserted in a place other than between # q and 
■#! q . In that case the rule qA will never be applied and neither will q.5 (because the symbol qA will not 
be deleted). The string will contain three symbols from N' U TVr (Y, # g , and #' q ), which makes any 
further simulation impossible. Again, the derivation halts at an invalid string. 

The simulation of the rule UV — > A is the longest. The two problems with this rule are that we need 
to remove the symbols U and V only if they are located in a certain order and that we need to remove 
exactly one instance of U and one instance of V. 



The correct simulation sequence is as follows: 



Ul uuvvu 2 =^4 mmj'rUvvui ^4 Ul m' r uvvu 2 ^> u 1 u$';$' r uvvu 2 
=^4 Ul u$';uvvu 2 ^4 Ul m';vvu 2 ^4 u 1 u% i ; , % i ;vvu 2 ^> Ul u$';'vvu 2 
=^4 Ul m'i'vu 2 ^4 uM'r'v !rf r u2 ^4 uM'I'v f r U 2 
^4 uiC/$< jv) $;"y/> 2 ^4 uil>$</ v) t//> 2 «i£/$^ v >/> 2 

^ Ul W r IV) U 2 U1 UU 2 , 

where ui, u 2 G (T U {[7, ?7 | [/ G {A, 5, C, D}})* . Note that r.18 will eventually erase all instances of U 
when the string will only contain symbols from T U {A, C}. 

The simulation of the erasing rule starts with fixing a $^ at the site between an instance of U and 
U. An instance of $" is then inserted before and erases both $' r and the U which is located next 
to it. Next is inserted, which erases both $" and the V which should follow immediately in a valid 
string. Then we fix a site immediately after a V by inserting f r f' r and having V erase f r . This allows 
to eventually insert $ ( r IV} whose mission is to erase and a V. We demand that $ ( r IV) erase f' r in order 
to assure that, on the one hand, $ { r IV) has erased V and, on the other hand, that f' r is right after the 
currently processed sequence UUVV. 

Note that, when erasing a sequence UUVV, we do not erase the U. Instead, we rely on rule r.17 
to do that when the string only contains terminals and, possibly, hatted versions of A and C. It is 
not necessary to erase U because, by erasing the pairs UV and discarding U and V, we already do 
the necessary checking of whether the non-terminals {A, B, C, D} are organised in proper pairs. The 
only situation when the left-over instance of A or C can interfere with other simulations of the rule 
r : UV — > A is when there are nested pairs, like AACCDDBB. In this case, the inner pair will be erased 
first, leaving a C. When the corresponding to the rule AB — > A will need to erase $" and B from 
the string, it can also remove the left-over C by rule r.10. Thus, the garbage symbols A and C cannot 
disrupt the simulation of other erasing rules. 

On the other hand, observe that is programmed to erase either $"V, or $"AV, or $"CV. Two 
hatted symbols one after another cannot be produced after the simulation of the context-free rules of 
the grammar, because those rules only produce pairs of hatted and barred symbols. This means that 
will successfully handle the garbage symbols resulting from the formerly existent nested pair, but will 
not be able to produce unwanted effects in other parts of the string. 

We will explicitly remark that, according to the prohibiting set of r.l, the simulation of a rule UV A 
can only start once the simulation of all other types of rules have completed. 

We will now try to see whether it is possible for the described simulation process to produce wrong 
results. Suppose that r.l does not insert the pair $ r $^. after U. In this case r.2 will not be applicable. 
Furthermore, the rule r.3 will not be applicable either, because the symbol $ r is still in the string. 
Obviously, the rules r.4, r.5, and r.6 cannot be considered because there is no symbol Since the 
symbol is not inserted either, the rules r.7 to r.ll arc inapplicable. Because the pair f r f' r is not 
inserted, the rules r.12 and r.13 cannot be applied. Finally, because no rule inserts $$/ v) , the rules r.14 
to r.17 arc not applicable either. Nevertheless, the symbols $ r and %' r are still in the string, which makes 
any other simulations impossible, which renders the whole branch of computation invalid. 

Similar considerations apply to the insertion rule r.3. In the case $" does not arrive at the proper 
site, %' r is not removed, which blocks further applications of rules. In an analogous way, the insertion of 

at an improper site renders the simulation invalid. And for the same reasons, the insertion of %\! v) 
anywhere but immediately before blocks the computation. Note that, before allowing a the symbol 
$i l ^ to erase the corresponding hatted or barred version of U or V, we demand that it erase the symbol 
$r In this way we guarantee that the symbols of the $ r family are always inserted inside the same 
sequence of UUVV as the one that was fixed by the initial insertion of $ r $' r . 

The only insertion rule left is r.ll. The rule r.12 and the forbidding contexts of the following rules 
guarantee that the symbol f' r is located after an instance of V . There is however no guarantee that this 
will be the instance of V which belongs to the processed sequence UUVV . Suppose that f r is indeed 
inserted after a different instance of V. In this case the rule r.16 will be inapplicable, because there can 
be only one instance of f r in the string, and this instance is positioned improperly. This will not allow 
% ( * V) to eventually be erased by the rule r.17 and will therefore block any further computations. 

□ 
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4.2 Computational Incompleteness Results 



In this section we will show that random context insertion-deletion systems of size (1, 1,0; p, 1,1) are not 
capable of generating all recursively enumerable languages, hence they are computationally incomplete. 

The proof below is based on the observation that it is impossible to control the number of applications 
of an insertion rule r : ((a;, y, A)j„ s , P, Q) when y is already present in the string. 

Obviously, in the case P = Q = 0, if the rule can be applied once, it can also be applied any number 
of times. We also remark that including x in either P or Q gives no advantage; including x in the 
permitting context is redundant, while including it in the forbidding context would just make the rule 
never applicable. Hence, there are two remaining cases: y Q, and y 6 Q. When y j£ Q, the situation 
is exactly the same as with P = Q = from the point of view of controlling the number of applications. 
On the other hand, when y 6 Q, the rule r will only be applicable when the string contains no y. So, 
when symbol y is already present in the string, rule r can be applied any number of times. 

Theorem 4. RE\RC IN S\ fi DEL 1 / ^ 0. 

Proof. We show that the language L = (ab) + cannot be generated with such systems. We shall do the 
proof by contradiction. Suppose that there is a random context insertion deletion system T = (V, T, S, R), 
r £ RCINSi DEL 1 ' 1 , such that L(j) — L. Without restricting generality, we will not allow R to 
contain rules erasing terminal symbols, see pQ or [3D] for more details. 

Consider a derivation S ===> w, where w £ (ab) + . We will focus on a pair of symbols ab: w = W\abw2, 
where 101,102 £ (ab)*. 

Since R does not contain rules erasing terminals, the instance of b which belongs to the current pair 
was either inserted at this site by an insertion rule or was a part of the axiom S. We will not examine 
this latter case, and will instead suppose that the word w is long enough. The rule which inserted b is 
of the form p : ((x, b, X)i ns , P, Q). It is very important to remark at this point that the permitting and 
forbidding contexts of the rule p cannot influence the position at which the insertion takes place. We are 
therefore not interested in the case x = A, because such a rule would be allowed to insert the b at any 
site on the string, and, in particular, we could obtain the string w' = w\baw2 $jL (ab) + . 

So, we have the following derivation: 

S ==>• w[xw 2 ==> WiXbw'2 =4- W\abw2 = W. 

where w' 1 ,w , 2 £ V*. Since w £ (ab) + we can suppose that \w[\b > (as well as \w' 2 \b > 0). In this case, 
according to our remark above, rule p can be applied an arbitrary number of times yielding the following 
derivation: 

S ==> w[xw 2 => Wixbw'2 ===>■ w' 1 xb + w 2 w±ab + W2- 

We observe that the last part of the derivation is possible because a deletion rule of size (p, 1, 1) 
cannot distinguish if there is one or more b present in the left or right context. 

We therefore conclude that, if S =>■ wiabw2, then S => w\ab + W2 and L(T) =/= (ab) + , which 
contradicts our initial supposition. □ 

5 Conclusion 

In this paper we introduced the mechanism of conditional application of rules for insertion-deletion 
systems based on semi-conditional and random context conditions. This mechanism permitted to achieve 
an increase in the computational power, especially for the case of semi-conditional insertion-deletion 
systems, which are showed to characterize the family of recursively enumerable languages with rather 
simple rules: context-free insertion or deletion of one symbol. This shows that the control mechanism 
is in some sense more powerful than the graph control, because in the latter case only the recursively 
enumerable sets of numbers can be generated [2 . The form of the rules and of the control mechanism 
permit to consider it as a particular case of Networks of Evolutionary Processors [5], so the obtained 
result can be transcribed in that area as well. 

In the case of random-context conditions we obtained results exhibiting an interesting asymmetry be- 
tween the computational power of insertion and deletion rules: systems having rules of size (2, 0, 0; 1, 1, 0) 
are computationally complete, while those having the size (1, 1, 0; 2, 0, 0) (and more generally of size 
(1, 1, 0;p, 1, 1)) are not. This result is surprising because until now all the obtained results in the area of 
insertion-deletion systems with or without additional controls were analogous if parameters of insertion 



and deletion rules were interchanged. The characterization of these classes gives an interesting topic for 
a further research. 

We remark that all the constructions were additionally tested using a self-developed dedicated sim- 
ulator that is available "on request" from the corresponding author. These tests permitted to verify 
that for a single rule application, all possible evolutions except the one corresponding to the correct one 
introduce in the string (groups of) non-terminal symbols that cannot be removed anymore. 
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